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Abstract 



It is widely accepted that population genetics theory is the cornerstone of evolutionary 
analyses. Empirical tests of the theory, however, are challenging because of the complex 
relationships between space, dispersal, and evolution. Critically, we lack quantitative 
validation of the spatial models of population genetics. Here we combine analytics, 
on and off-lattice simulations, and experiments with bacteria to perform quantitative 
tests of the theory. We study two bacterial species, the gut microbe Escherichia coli 
and the opportunistic pathogen Pseudomonas aeruginosa, and show that spatio-genetic 
patterns in colony biofilms of both species are accurately described by an extension 
of the one-dimensional stepping-stone model. We use one empirical measure, genetic 
diversity at the colony periphery, to parameterize our models and show that we can then 
accurately predict another key variable: the degree of short-range cell migration along an 
edge. Moreover, the model allows us to estimate other key parameters including effective 
population size (density) at the expansion frontier. While our experimental system is a 
simplification of natural microbial community, we argue it is a proof of principle that the 
spatial models of population genetics can quantitatively capture organismal evolution. 
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Introduction 



The neo-darwinian synthesis brought p opulation genetics into the heart of evolutionary biology (IWright 



I932I : iFisherl . Il930t iDobzhanskyl . 119371 ). something that has since been crystallized in the vast 



array of sophi sticated models, which i r icorporate factors like stochasticity, spa tial structure, 
and epistasis (I Crow and Kimural . Il970l : iFelsensteinl . Il976l : iLoewe and Hilll . I2010I ). Despite this 
and the widespread acceptance of the population genetics as the cornerstone of the evolut ionary 
theory, the low ratio of empiricism to modeling has long been noted in the literature (IMayii . 



I959I : iRoussetl . 12004 ). Although genomics studies can be a very useful for testing qualitative 



predictio ns of populati on genetics, e.g. the importance of effective population size for purifying 
selection ( Lynchl . l2002l ). a controlled experiment in the laboratory is better suited for the specific 
goal of validating the theory. 

One of the key barriers to quantitative tests is that many theories of population genetics 
consider well-mixed populations, i.e. populations without spatial structure. The assumption 
of a lack of structure can often be a drastic approxir nation. Moreover, the ever-growing 



number of theories that do incorporate spatial structure (lExcoffier et al.l . l2009l : iRoussetl . |2004J : 



Nagylaki and Loul . l2008l ) presents a new set of problems because empirical tests require the 
knowledge of certain d emographic pa rameters, such as dispersal rates, for which little data 
are typically available (IRoussetl . 12004 ). An open challenge for population genetics, therefore, 
is to develop new empirical methods and tests of the theories themselves, and, in particular, 
quantitative tests that can rigorously verify models of spatially structured populations. 

We focus on neutral population genetics theory of effectively one- dimensional populations 
applied to range expansions. The imp ortance of spa tial separation for the develo pment of 
geneti c diver sity was firs t emphasized bvlWrightl (19431) and was fur t her developed by iMalecot 



( ll955l . Il975[ ): see also (iKimura and Weissl . ll964J : iNaevlaki ll974J : iBarton et all 120021 ). In 



addition to stationary populations, Malecot's analysis al so describes populations undergoing 
range expansions as was shown in ( iKorolev et al.l . I2OIOI ). While the theory is in principle 



general to a range expansion of any species, we focus here on one of the most tractable of 



exper imental systems, the colony biofilms formed by bacteria on solid surfaces (iRani et al. 



2004). 



Microbial populations are often used to elucidate the mechanism s of evolution because o: 



their short generation times and the ease of genetic manipulat ion (lElena and Lenski . 



Barrett et al.l. l2005t IWest et al.l. l2007l). W ith a few exceptions (jRainey and Travisanol . 



20031 : 



19771 : 



Kerr et al.l . l2006l : iRainey and Raineyl . |2003|), the homogeneous environment of a chemostat or 



a vigorously shaken test tube is preferred because of its simplicity and the ease of comparison 
to mathematical models, which are often intractable for populations with spatial structure. 
However, even for microbes, it is now clear that most natural populations are not well-mixed, 
and the neighborhood of any individual is enriched in the individuals that are ancestrally 
related to it. This is especially t rue for many bacteria commonly forming dense s urface-attached 
communities known as biofilms ( iKolter and Greenbergl . 120061 : iNadell et al.l . |2009|) . These resilient 
microbial communities are found in numerous contexts, both medical and industrial, and are 
often highly resistant to antibiotics. For this reason, there is now a large body of literature on 
biofilm formation, which underlines their complexity wi th major differences in t he regulation 
and development of biofilms among strains and species (iBeloin and Ghigol |2005| ). Key factors 
affecting the structure of biofilms i nclude physica l inter actions among cells, the diffusivity of 
metabolites ( iDockery and Klapperl . 1200 ll : IStewartl . |2003|), and the genetic regulation of growth 



3 



rate in response to both nutrient levels (IVulic and Kolterl. 120011) and quorum s ensing (jParsek and Greenberg 



19991 : IVulic and Kolterl . l2001t iHammer and Basslerl . l2003l : iNadell et al.l . 120081 ) . Growth in microbial 



groups also often relies on growt h-promoting secretio ns, which themselves can be under nutrient 
and quorum sensing regulation ( IXavier et al.l . 1201 ll ) . 

Despite the potential complexities, the spatio-genet ic patterns in microbial biofilms can be 



visua hzed by fluorescent labeling part of the population (iKlausen et al.l . 120031 : iHallatschek et al. 



20071 ). With this setup, cells of the same color can be ancestrally related, while cells of different 
colors cannot (provided mutations that swit ch colors can be neglected). Colony biofllms are 



particularly amenable to such visualizations ( iRani et al.l . l2007l ). As an example, the fluorescent 



image of a bacterial colony is shown in Fig. [Tl\. The most striking feature of this image 
is that an initially well-mixed population of two alleles spatially separates or demixes into 
monoallelic sectors. The sectoring pattern in this colony is due to genetic drift at the front 
of the expansion, i.e. stochastic changes in the allele frequencies due to random variation in 
the number of offspring keeping up with the outgoing expansion. Since at any given time 
only a few cells at the edge of the expansion colonize the Petri dish, the fluctuations can be 
large and lead to random local increase in the number of cells of one color at the expense 
of the other. Once one of the colors becomes extinct as a result of this stochastic process 
in a small region of the expanding colony, a sector is formed, and it persists as long as its 
boundaries do not coalesce. Such spatio-g: e netic correlations signiflcantly affect the evolutionary 



dynamics of popul ations (iKorolev et al.l. 120101). Gener al examples rang e from the neutral 



theory in ecologv (iHouchmandzadeh and Valladd . l2003l ) and evolution (iKimura and Weisd. 



1964 ; ICharlesworth et al.l . |2003[ ) , thro ugh the propensity for raalignant tumor formation (iMichor et al 



2003), to the evolution of cooperation (INowak and Mayl . ll992l : lNowak et al.l . ll994l : lHauert and Doebelil . 



20041 ). Indeed, the latter has direct relevance to microbes and the evolution of growth-promoting 



secretions that are cooperative in the sense that the secretion of one cell will help others 
grow. These secretions often require strong spatio-genetic structu re that prevent exploitat ive 
genotypes from using the secretions of the cooperative genotypes (INadell et al.l . l2010l |2009|). 

Here, we use the spatio-genetic patterns in bacterial colonies to test the population genetics 
models of range expansions. We begin by synthesizing and extending the theory of range 
expansions in a way that allows us to compare the predictions of this theory to our experimental 
data. In particular, two aspects of the theory are novel. First, we flnd that chiral or non-neutral 
growth results in sector boundaries being logarithmic spirals [Eq. (^]. Second, we present a 
detailed discussion on the role of population density on the number of surviving sectors. This 
theory of range expansions is tested using experiments with two ecologically distinct bacterial 
species: the gut microbe Escherichia coli and the opportunistic pathogen Pseudomonas aeruginosa. 
Speciflcally, we test the dependence of the genetic diversity at the population front (sector 
number) on the physical size of the founding population (initial radius of the colony) and check 
the assumption that boundaries between genotypes perform radial random walks. We show 
that the model not only captures the qualitative relationship between demographic parameters 
and spatio-genetic patterns, but it also allows us to estimate these parameters with considerable 
accuracy. This quantitative validation of the theory supports the utility of population genetics 
models to capture the evolutionary process. 
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Methods 



Data deposited at Dryad: doi:10.5061/dryad.3147q. 



Bacterial Strains and Plasmids 



We used the same two strains of E. coli as in ( iHallatschek et al.l . 120071 ). where it was shown that 
these strains are equally fit in well-mixed liquid culture and genetically identical except for the 
different fluorescent protein gene carried on plasmids. The P. aeruginosa strain was generously 
provided by Roberto Kolter and w as originally isolat ed from fresh water in Columbia. We 



labeled this strain with cfp and yfp (IXavier et al.l . |2009| ) resulting in two strains, which are also 
genetically identical except for the fluorescence gene inserted in the chromosome. The pairs 

of colored strains within each species grew at the same rate and behaved neutr ally on the plates. 

This was confirmed from the shape of the sector boundaries in the experiments (IHallatschek and Nelson! . 
20101 ). In particular, strains with a fitness advantage would tend to curve outward and cut off 



neighboring strains; see Eq. (9). This boundary bending was not observed and the analysis of 
sector boundaries showed that, both for E. coli and P. aeruginosa, the pairs of colored strains 
had the same fitness in our experiments; see Results. 



Growth Conditions 

Each strain was first grown separately in 3 mL of liquid LB medium supplemented with 
the appropriate antibiotic (carbenicillin for E. coli and gentamycin for P. aeruginosa) at the 
concentration of 100 /xg/mL. The test tubes were incubated at 37°C and constantly shaken. 
After an overnight growth, bacteria were diluted to an optical density of 0.05 (at 600 nm 
wavelength) in 3 mL of liquid LB medium without antibiotics and grown for 2 hours. Then, 
the two strains of interest were mixed in 1:1 ratio, as measured by optical density, and a small 
volume of bacterial mix ranging from 1 /xL to 64 /iL was pipetted on a LB- agar plate with 1.5% 
agar and 100 /xg/mL of appropriate antibiotic. The plates were then incubated at 21°C or 37°C 
and high humidity. E. coli strains did not grow at 21°C. 



Microscopy and Image Processing 

Fluorescent images were obtained with a fluorescence stereoscope Zeiss Lumar V.12 and a 
scanner Amersham 392 Typhoon 9400 (GE Healthcare). Scanned plates were imaged from the 
bottom at 50 /xm resolution with Alexa 526 filter. 

The initial radii of the colonies were measured within an hour of inoculation by fitting a 
circle to the colony boundary using the fluorescent microscope software. The number of sectors 
in each colony was calculated manually. MatLab R2007 was used to extract the radii of the 
colonies at later times and the shape of sector boundaries from the images. In particular, we 
used the routine edge to iden tify sector b oundaries, and a least square fitting routine to find 
the circumference of colonies flPrattl . ll987h . 



To calculate the heterozygosity H{t, 0) defined by Eq. ([3]) and shown in Fig. [21 we obtained 
the allele frequencies f{t,(j)) along a set of concentric circles (which stand for different time 
points). While, in simulations, the frequencies are known, we had to calculate f{t,<f)) in 
experiments from the fluorescence intensities of the colony images. This was done by choosing 
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a threshold intensity such that any pixel with higher intensity is assigned / = 1 and any pixel 
with lower intensity is assigned / = 0. We then used Eq. (|3]) to calculate the heterozygosity, 
and the averaging was done over all the points angle apart in a given circle and all the circles 
corresponding to the same time point in the colonies used in that experiment. 



Estimating Dg and Dg 

Here we describe how Dg and Dg were estimated from fluorescent images of genetic demixing. 
The diffusion constant [D^ in Tabled]) was estimated from the random walks of sector boundaries 
We first obtained the polar coordinates ip{r) of individual sector boundaries as described above. 
Once the coordinates were known for r G (rj, Vf), we evaluated (p{r) — (p{ri) for each boundary. 
We then estimated the mean and variance of this quantity as a function of ln(r/rj) and l/vi — l/r 
respectively. The model accurately described boundary motion, and the variance of if{r) — (p{ri) 
was a linear function of l/r^ — 1/r; therefore, 2Ds/v\\ was found from the least mean square 
estimate of the slope. 

In addition, both Dg and Dg [D^ in Table [T]) were estimated from the experiments done 
with different volumes of inoculant. Different volumes of inoculant led to different initial 
radii of the colonies, which were measured as described above. The observed values of radii 
were then binned, and the average number of sectors at the end of the experiment (when 
coalescence of sectors ceased) were calculated within each bin. The average number of sectors 
was a linear function of the square root of the initial radius. The least mean square estimate 
equals Hq^2t:v\\/ Dg for the slope and 2txHqV\\/ Dg for the intercept; see Eq. ([7]). Since both Hq 
and v\\ were measured directly, these least mean square estimates allowed us to measure Dg 
and D^. 



On-Lattice Simulations 

We also tested our genetic demixing model described below via simulations designed to mimic 
range expansions of natural populations. In these simulations, the habitat was a square lattice 
of isla nds that could support a t most individuals, similar to that in the stepping-stone 



model (IKimura and Weissl . 119641 ). During every time step, each island had one opportunity for 
outward migration and one opportunity for growth. 

During the migration update, an individual from an island could migrate to a randomly 
selected nearest neighbor island with a probability proportional to the fraction of cells on the 
island and the fraction of empty spots (vacancies) on that neighbor island. The coefficient 
of proportionality d < 1 determined the magnitude of the effective diffusion constant, and 
was used to vary the rate of migration. Since the migration rate was proportional to the 
fraction of vacancies, an organism could exchange positions only with a vacancy, not another 
organism. Hence, there was no migration in the dense-packed interior of the simulated colony. 
Its spatio-genetic pattern was thus a frozen record of the genetic composition of the expansion 



front, similar to (IHallatschek et al.l . 120071 ). Hence, we analyzed both the experimental data 
and the simulation data on an equal footing. Periodic boundary conditions were used on all 
four sides of the two-dimensional habitat. 

During the growth update, a randomly selected individual in each island reproduced with a 
probability proportional to the fraction of the cells on the island and the fraction of vacancies 
on that island; the coefficient of proportionality p < 1 allowed us to vary the growth rate. 
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We checked that this update rule could reproduce simple logistic growth (iMurrayl . 120031 ) . To 
compare with the experiments, we simulated only two alleles and tracked the genetic identity 
of the individuals throughout the simulation. A typical simulated range expansion is shown in 
Fig. [IB. 



OfF-Lattice Simulations 

We also carried out simulations using an off-lattice, agent-based approach that explicitly 
described the dynamics of each agent based on the micro-environment it perceived. The agents 
represented cells and their behavior was governed by the rules that mimic the behavior of 
real bacteria: they increased in radius when taking up nutrients, divided once they reached a 
certain critical size, and moved continuously (off-lattice) when pushed by neighboring cells. The 
pushing algorithm was based on a hard-sphere model where each agent moved in the direction 
minimizing the overlap with neighboring agents. The model used a fast numerical method (FAS 
multigrid) to solve partial differential equ ations for the nutrie nt concentration. The algorithms 



were described in a previous publication (jXavier et al.l . |2005[ ) 



We simulated neutral evolution experiments by implementing two genotypes with equal 
phenotypes except for their color. Agent growth was determi ned by the local co ncentration of 
the growth-limiting nutrient following Monod kinetics, see ( iNadell et al.l . l2010l ). Simulations 
were initiated with a number of individuals placed in a circular homeland. The growth of the 
agents and colony was then followed in time, producing structures such as the one shown in 
Fig. [DC. 



Models of range expansions 

The importance of space to evolutionary processes has been studied in many disciplines including 
population genetics, ecology, and probability theory, where the classic spatial models, fo r 
example the step ping-stone model (iKimura and Weissl. Il964j: iNaevlakil . Il974j : iMalecotl . 119751). 



Hubbell's model ( lHubbelll . l200ll : lHouchmandzadeh and Valladel . l2003l ). and the vot er model (ICox and Griffe 
1986[) have been developed for st ationary (non-ex panding) populations. Following (IKorolev et al.l . 
2OIOI ) , we use coalescent theory (IKingmanl . Il982l ) to track the genetic diver sity at dif f erent points 



in space and adap t the classic results from spatial populatio n genetics ( iMalecotl . Il955l Il975 



Kimura and Weissl . Il964j : iNagylakil . Il974j : iBarton et al.l . 120021 ) to understand the evolutionary 
dynamics of clonal populations during a range expansion. 

While neutral expansions typically occur on a two-dimensional surface, a one- dimensional 
model is sufficient to understand how genetic diversity changes at the front of the expanding 
population. For stationary (non-expanding) populations , the dimensionality of space plays an 



important role f lBarton et al.l . l2002t iKorolev et al.l . I2OIOI ). and dimensional reduction from two 



to one spatial dimensions is impossible. In expanding populations however, this dimensional 
reduction is possible because all relevant dynamics occurs at the perimeter of the population, 
which is effectively one-dimensional. Indeed, the organisms far behind the expansion front 
cannot affect the future genetic composition at the frontier because their offspring are not part 
of the outgoing expansion wave. Although the dynamics of allele frequencies at the front can be 
described by an effective one- dimensional model, this is not in general true for the population in 
the wake of the expansion, for which a full two-dimensional model is necessary. Nevertheless, 
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the results of the one-dimensional model are a pplicable behind the front on the time scales 
when migration in the interior can be neglected (IHallatschek and Nelson! . 120101 ) . 

Although a one-dimensional model is easier to analyze, the demographic parameters in 
this model are in general complicated functions of the demographic parameters of the original 
two-dimensional population. For example, the effective linear density (carefully defined in 
Appendix A) of the one-dimensional population at the frontier depends on the carrying capacity 
of the habitat, the shape of the density profile at the fro nt, and the probability of indiv iduals 
slightly behind the expansion edge to migrate forwarcll] ( IHallatschek and Nelson . 2008 ). The 
exact nature of these dependences varies for different reproduction and migration models, but 
the effective one- dimensional description is universal, once model-specific details are absorbed 
in the effective demographic parameters. 

To make a direct connection with the subsequent experiments, we assume that only two 
genotypes are present, and that they have the same fitness. We also assume that the colony 
radius R increases linearly with time t: 



R(t) = Ro + viit, 



(1) 



where v\\ is the radial expansion velocity, and Rq is the radius of the colony when t = 0. 



Expa nsions with a constant velocity are expected for rea ction-diffusion population dvnamics (jMurrav 



2003 ) and have been observed in microbial colonies ( IHallatschek et al.l . 120071 : IWakita et al. 
19941 ). Range expansions in our simulations also obey Eq. ([1]). 



Spatial genetic demixing (Fig. H]) is one of the key predictions of this model. Although the 
total number of individuals is very large, locally only a finite number reproduce, leading to 
genetic drift (i.e. fluctuations in the genetic composition of a population due to the random 
sampling of offspring to form next generation). Over time, these demographic fluctuations 
tend to reduce genetic diversity at the growing front of the population. Eventually, a single 
genotype will reach fixation at any point on the expanding edge. In bacterial colonies, this 
fixation manifests as the formation of a sector (Fig. [IJA.). 

Below we present two complimentary ways to rigorously analyze the behavior of the one-dimensional 
populatio n at the expans i on front. The first approach is an extension of classic spatial population 



genet ics (IMalecotl . Il955l . Il975l : iKimura and Weisd . 11964 iNagvlaki 11974 Il998l : iBarton et al. 



2OO2I ) using coalescent theory ( iKingmaru . Il982l ) to track the genetic di versity at different 
points in space. At the core of this approach is the stepping-stone model iKimura and Weiss 
( 119641 ). which describes migration and reproduction of individuals in a population. A concise 
des cription of this model is given in Appendix A. Th e secon d modeling approach was proposed 
in ( IHallatschek et al.l . 120071 : IHallatschek and Nelsonl . l2010l ) and tracks the domain boundaries 
between the regions with different genotypes. While it is applicable only after monoallelic 
domains (sectors) form, the second approach yields predictions complimentary to the predictions 
of the first approach. We use these additional predictions to make an quantitative experimental 
test of our theory. 



^Organisms expanding in spatially homogeneous habitats typically reach the carrying capacity in the interior 
of the population, but the population density decays to zero beyond the expansion front. Here, we assume 
that the density profile at the front remains constant during the expansion resulting in a constant population 
density in the effective one-dimensional model. This behavior is typical for reaction-diffusion systems with 
saturation /FisheA^ 1937 : Kolmogorov et al. . 1937 : Murrav . 2003 ) and is confirmed in our simulations; see also 
(NadeU et al.. . .20101 1 
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Model 1: Tracking local genetic diversity 



Spatial genetic demixing can be most readily understood by calculating the genetic diversity of 
the population over time. The genetic composition of the front at time t is fully described by 
the relative fraction (frequency) of one of the two alleles f{t, 0), where is the azimuthal angle 
around the center of the population. The relative fraction of the other allele is then 1 — / (t, 0). 
These fractions change in time because of migration and reproduction. One can think of the 
population as a circle of islands (demes) each carrying a certain number of organisms (this 
number of organisms sets the strength of genetic drift). Each generation, the organisms 
reproduce within a deme, say by a Wright-Fisher process, and migrate b etween neighboring 



demes. This conceptual population model is the stepping-stone model of iKimura and Weiss 



( 1l964j ): we present a mathematical formulation of the stepping-stone model in Appendix A. 

Because /(t, 0) is a random variable, which is different for every expansion, we need to 
calculate quantities averaged over repeated expansions to compare the predictions of the model 
to experimental data. The first moment (mean) of f{t, 0) does not contain information about 
genetic demixing, because the relative fraction of two neutral genotypes does not change on 
average 

E[/(t,0)] = /o, (2) 

where /o is the initial frequency of allele one, and E denotes averaging with respect to different 
realizations of the range expansion. 

The second moment (variance), however, captures the spatio-genetic correlations. We define 
the average spatial heterozygosity as 



H{t, 01 - 02) =E{/(t, 00 [1 - fit, 02)]} 

+ E{/(t,02)[l-/(t,0i)]}, ^ ^ 

which is the probability of sampling two different alleles separated by distance |0i — 02|-R(t) 
along the circular frontier. Note that H depends only on the relative separation between the 
two points because of rotational invariance of the averages. 

At the beginning of the expansion, H{0, (p) = Hq = 2/o(l — /o) because the individuals are 
well mixed. As the expansion progresses and large monoallelic domains appear, the average 
spatial heterozygosity decreases for smaller than the typical domain size because sampling 
two different alleles in close proximity is unlikely when the population is demixed. For much 
larger than the typical domain size, the average spatial heterozygosity is close to Hq because 
the two sampling points belong to different and uncorrelated sectors carrying allele one with 
probability /g or allele two with probability 1 — /q. Hence, the spatial scale over which H{t, 0) 
increases from its lowest value to Hq characterizes the typical domain size. (See Fig. |2j\ and 
the discussion below.) 

As in well-mixed populations, the depression of H(t, 0) for small can be understood by 
working backward in time and calculating the probability of two individuals to have a common 
ancestor. Apart from t, this probability depends on the spatial separation of the individuals, 
the extent of lateral movement during the expansion, and the density of the population. Indeed, 
for their ancestral lineages to coalesce (converge on the common ancestor), the spatial random 
walks performed by the lineages must move to the same spatial location, and the lineages must 
originate from the same individual in that location. Therefore, the forward-in-time equation for 
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the dynamics of H{t, (p) must have a diffusion term to account for the lateral random walk of 
the ancestral lineages and a sink term to account fo r the loss of genetic diversity due to lineage 
coalescence. This equation is rigorously derived in ( Korolev et al. , 2010l ) (see also Appendix A) 
and is given below 



d_ 

di 



H{t, 



2D, 



{Ro + ^11^)2 



(4) 



Ro + v\\t 



0)5(0), 



where 5(0) is Dirac's delta function, is the spatial diffusion constant of an ancestral lineage, 
and Dg6{x) is the coalescence rate of two ancestral lineages separated by distance x along 
the front. This rate and, therefore, Dg are inversely proportional to the generation time 
and decreases with the population density at the front because the probability of two nearby 
organ isms to be siblings is inversely proportional to the effective population density (lEwensl . 
19821 ). defined in Appendix A. In other words, Dg is a phenomenological parameter characterising 

the strength of genetic drift. Although one may naively expect that Dg is invers ely proportional 

to th e actual population density (or carrying capacity), this is often not the case (IHallatschek and Nelson! . 
20081 ). The factors of Rq + v\\t are necessary to relate the angular coordinate to distances 
along the constantly inflating circumference of the population. 

From the forward-in-time perspective, the first term on the right hand side of Eq. 
represents the diffusive mixing in the population, and the second term is responsible for the 
loss of genetic diversity due to sampling. Smaller population densities lead to more severe 
bottlenecks, faster reduction in local genetic d iversity, and, therefo re, larger values of Dg. 

The exact solution of Eq. (jl]) is given in ( iKorolev et al.l . |2010| ) and is plotted in Fig. |2j^. 
for a particular choice of the parameters. The coalescence term in Eq. acts as a sink and 
is responsible for the development of the dip in H{t, 0) around = 0. This decrease of the 
probability to find two different alleles at the same spatial location [given by H{t, 0)] tracks 
local fixation and the formation of sectors. 

There are two important time scales for the genetic demixing process in the model. The 
first time scale characterizes the interplay of migration and genetic drift and is the time over 
which genetic diversi ty is lost local l y (mo noallelic domains form) when radial growth is slow. 



As it was shown in (IKorolev et al.l . |2010| ). this time scale is given by 



Ts = DJDI. (5) 

The other time scale characterizes the rate of the spatial expansion and is the time it takes the 
colony to double its initial radius: 



Td = RQ/v\\. (6) 

As we shall show below, the ordering of these time scales (r^ > vs. < r^) determines 
whether the domain formation is sensitive to the constant increase in the front circumference. 
For long times t ^ Td and t ^ r,, the ayerage spatial heterozygosity H{t, cj)) reaches a 



steady state, see Fig. [2]A and ( Korolev et al. . 2010l ). At this point, the diffusive motion of 



the sector boundaries is negligible compared to the deterministic increase in separation among 
sector boundaries due to colony expansion; therefore, the sector boundaries stop coalescing, 
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and their relative angular positio ns remain largely un changed. The number of sectors in this 
asymptotic state is calculated in (IKorolev et al.l . I2OIOI ). and is given by 



(7) 



The two terms on the right hand side of Eq. ([7]) reflect the dominant contributions of the 
two possible regimes for different orderings of Td and r^. The second term on the right 
hand side of Eq. (JTj) gives the dominant contribution to M{Ro) when sectors form before the 
diffusive motion of sector boundaries becomes weaker than their deterministic separation due 
to colony expansion. This condition is equivalent to ^ r^, i.e. the sectors form before 
the colony expands significant lyH This term was first calculated in ( Hallatschek et al. . 2007 : 
Hallatschek and Nelsonl . I2OI0I ) by neglecting the initial stage of genetic demixing when distinct 
sectors appear. The first term on the right hand side of Eq. ([7]) is an additional contribution 
to N'{Rq) from the early stage of genetic demixing and gives the dominant contribution in the 
opposite limit ^ Td when sectors do not coalesce after formation. 

Equation ([7j) allows us to estimate Ds and Dg (see Methods), and forms the basis of the 
quantitative test of spatial population genetics discussed below. 



Model 2: Tracking domain boundaries 

Although Eq. (jlj) explains spatial genetic demixing during range expansions, it does not directly 
describe the dynamics of the domain boundaries, one of the most prominent features of the 
spatio-genetic patterns shown in Fig. [TJ We now introduce a second modeling approach to 
understand genetic demixing that ignores the dynamics of individual organisms and instead 
focuses on a larger spatial scale of domain boundaries among monoallelic regions. This forward-in-time 
approach is simpler than, and complementary to, the coalescent analysis outlined in the previous 
section. 

After the sectors form, the dynamics of the population can be described in terms of sector 
boundaries. As ancestral lineages, sector boundaries diffuse and coalesce upon meeting. When 
domain boundaries meet, the number of domains decreases, and the average sector size increases, 
further increasing the average separation between the alleles. Random walks that disappear 
upon meeting eac h other (so called annihilating random walks) have been extensively studied i n 
linear geometries (lOdorl . |2004J : iBramson and Lebowitzl . Il99ll : iMasser and ben Avrahanj . I2OOOI ) . 
In the circular geometry of interest to us here, a phenomenological description of domain 
boundary motion consistent with Eq. (jl]) is given in (IHallatschek et al.l . 120071 : iHallatschek and Nelsonl . 
20101 ) and is summarized by the following stochastic differential equation: 



sector of angle $ increases in size as $i?(<), so the change of the sector size due to the colony growth 
in time Ai is /S.t^v\\ . The change of this sector size due to the diffusive motion of the boundaries is on the 
order of \/DjAi. These two changes are equal for At* ^ /^^/(w^^^). The typical size of sectors when they first 

-2 n-i 



appear is about ^/DgTs = Dg/Dg] therefore At* ^ R^D'gV^''D^^ = tJ/ts. Thus, for «C Td, At* ^ Ts, 
which implies that sector boundaries appear much earlier than the time when the deterministic expansion moves 
them far apart. Consequently, sector boundaries have a significant chance to coalesce, eliminating some of the 
sectors. 
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diflr) = ar + a / 

v\\r y v\\r 



,dB{r), 



(8) 



where (/9(r) is the trajectory of a sector boundary in polar coordinates, B{r) is a Weiner process, 
and v± is the velocity of the boundary perpendicular to the direction of the expansion. As in 
Eq. (jl]), the factors of r are needed to switch from Cartesian to polar coordinates. Note that Dg, 
the diffusion constant of the boundaries in Eq. (IHl), is exactly the same as the diffusion constant 
of ancestral lineages in Eq. (jll). The equality of diffusion constants can be established by 
comparing the average domain sizes cal culated using model 2 and mode l 1 in the limit Dg — t- oo. 



This equality has also been derived in (IHallatschek and Korolevl . |2009[ ) for the one-dimensional 
stepping-stone mod el, which describes range expansions with linear fronts. 

In the model of ( Hallatschek et al. . 2007 : Hallatschek and Nelson . 2010l ). the perpendicular 
velocity vj_ accounts for the growth rate difference between the organisms on the opposite 
sides of the boundary. For neutral genetic demixing, v± equals zero, unless the colony growth is 
chiral. By chiral growth we mean that sector boundaries and ancestral lineages twist exclusively 
clockwise or exclusively counterclockwise. This should be distinguished from sector boundary 
bending due to unequal fitnesses of the strains, which causes sector boundaries around the fitter 
strain to bend in opposite directions. 

In fact, we found that E. coli colonies have a noticeable chirality [^, see Fig. Indications 



of weak chirality for the same strains were reported in (IHallatschek et al.l . 120071 ): however, 
chirality is more pronounced under our growth conditions. From Eq. ([8]), the average twisting 
of the sector boundary should obey the following equation 



V \ r 

E[v9(r)] = if{r,) + ^ In - 



(9) 



where the radius rj and angle '^iji) define the initial point from which we follow the boundary. 
(The subscript i in refers to "initial" and is not used for indexing purposes.) The initial 
radius is larger than i?o because the boundaries are clearly visible only after the initial stage 
of genetic demixing. The perpendicular velocity can then be estimated from the slope 
of (y9(r) vs. ln(r/rj). Once the chiral parameter is known, we can subtract the deterministic 
part from the motion of the boundary and focus on the diffusive part described by Eq. ([H]) 
with = 0, as we shall do in the rest of this paper. We shall further assume that v^(rj) = 
since we can always rotate our reference frame by an arbitrary angle. Note that Eq. (jl]) remains 
valid for colonies with chiral growth like the one shown in Fig. [5j\ because the average spatial 
heterozygosity, defined as the probability of two sampled individuals to carry different alleles, 
depends on the separation between the two sampled individuals (and not their absolute angular 
positions), which is unaffected by the deterministic twisting of the ancestral lineages. We note 



■^Macroscopic chiral patterns are often related t o microscop ic sym metry breaking. After this paper was 
submitted for publication, we learned of work by IWang et al. ( 2011 ) which reyeals a left-handed twisting 
m otion of the E. col i cell w all by tracking various markers. Given the approximate statistical bias observed 
by IHallatschek et al. ( 2007 ) for the first few close-packed rows of actively growing E. coli cells to be oriented 
tangent to the border of an actively growing frontier, the screw-like growth observed by Wang et al. on the 
micron scale might explain the macroscopic chiral patterns on a centimeter scale. Indeed, the advance of a 
left-handed screw at the frontier would result in sector boundaries twisting counterclockwise when viewed from 
the bottom of a Petri dish, as we see in Fig. [5^. 
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in pas sing that the a verage sector shape given by Eq. defines an equiangular logarithmic 



spiral ( iHuntleyi . Il970l ). with the angle determined by the dimensionless ratio v±/v\\. 

To characterize the randomness of domain boundaries motion, we compute the variance 
of (f{r) from Eq. (|8]): 

Var(^) = ^ fl - iV (10) 



As r — >■ cxo, the variance of reaches a finite limit, which is consistent with the prediction of 
Eq. (jl]) that the coalescence of domain boundaries ceases at long times. The predicted inverse 
dependence of Var(y3) on r can be used to check that domain boundaries behave as circular 
random walks and to estimate Dg from the experimental data. Indeed, 2Ds/v\\ is the slope of 
the line Va.T{ip) vs. 1/rj — 1/r. 

The annihilating random walks of sector boundaries and ancestral lineages provide an 
intuitive explanation of genetic demixing, but a quantitative description of genetic drift in 
bacterial colonies is needed to analyze evolutionary experiments done on Petri dishes as precisely 
as those done in liquid culture. The experiments and simulations presented here were designed 
to test whether the dynamics of bacterial colonies could be accurately described by Eqs. (jlj) 
and (IHl) by checking their predictions: Eqs. (I7j), (jO]), and ( JTOl) . 



Results 

We performed experiments and on and off-lattice simulations (see Fig. [1]) to test whether 
our models accurately describe the development of spatio-genetic correlations during range 
expansions. In the on-lattice simulations, range expansions are driven by migration into 
unoccupied territories followed by growth to saturation, similar to the range expansions of 
natural populations over long distances. In contrast, the off-lattice simulations were designed 
to mimic microbial colonies whose range expansions are driven by the mechanical forces that 
cells exert on each other as they grow. The utility of the simulations is threefold. First, they 
allow us to confirm that it is indeed possible to describe two-dimensional expansions with a 
one-dimensional model. Second, the simulations show that spatial genetic demixing is a generic 
phenomenon occurring in populations with very different biological properties and therefore 
growth dynamics. This in turn enables us to test whether our phenomenological model can 
accurately describe genetic demixing in these quite different populations in addition to our 
experiments. Third, the simulations al lows us explore whether secto r boundaries a r e affe cted 



by roughening of the expansion front (IHallatschek and Nelson! . l2010l : iKardar et al.l . Il986l ) for 
parameter values of interest to us here. 

For both experiments and simulations, we found that the expansion velocity remained 
constant during the time of observation, see Fig. [31 The values of the expansion velocities 
are given in Table [TJ We also found that the motility and growth of the cells behind the 
front were significantly reduced, and the sectoring pattern did not change in the interior of 
the colony. Therefore, the sectoring pattern was a record of the genetic composition of the 
colony front throughout the expansion: A ring of radius r > Rq around the center of the colony 
represents the state of the colony front at time t = {r — Rq)/v\\. This mapping of time onto 
radius allowed us to relate the time dependence of any variable in our one-dimensional model 
to the two-dimensional pattern in a colony. 
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A very direct way to verify the theory is to compare the solution of Eq. (jl]) to H{t, (j)) 
obtained in simulations and experiments, see Fig. [2l We found a qualitative agreement between 
the experiments, simulations, and the theory, but quantitative comparisons could not be made 
because we were unable to reliably correlate the fluorescence intensity to the fraction of the 
corresponding cells at a given pixel. Therefore, we focused on the two variables in the model 
that can be examined quantitatively: the random- walk-like motion of the sector boundaries '^{r) 
and the number of surviving sectors at long times M{Ro). For both variables, we first confirmed 
that the experimental data was consistent with the corresponding analytical dependence. After 
this, we estimated all four model parameters: Dg, Dg, i?o, and v\\, and checked that different 
estimates agree. 



Lateral movement of genotypes. 

By tracing a large number of sector boundaries ip{r), we verified that their trajectories had 
the statistical properties of random walks in the circular geometry. For P. aeruginosa colonies, 
the average angular positions of sector boundaries remained constant indicating equal fitness of 
the strains; see Fig. |H E. coli colonies experienced chiral growth, and the boundaries twisted 



counterclockwise on average; see Fig. |5JA.. This twisting [also observed in ( iHallatschek et al. 



20071 )] was consistent with constant lateral velocity v±_ and Eq. (|9]) as shown in Fig. [33. We also 
analyzed the deviations of sector boundaries from their average position. As shown in Fig. [5p, 
the variance of (p{r) was consistent with the random walk behavior in the circular geometry 
summarized by Eq. fllU]) . 



Genetic diversity at the population front. 

Our model of genetic demixing during bacterial range expansions relies on three assumptions: 
equal fitness of the strains, expansion with a constant velocity, and diffusive migrations at the 
colony front. Since these assumptions were confirmed by the experimental data, we turned to 
the number of surviving sectors given by Eq. ([7]), which relies not only on the aforementioned 
assumptions but also on the solution of the model. We found that our one-dimensional model 
provided a good description of two-dimensional simulations (Fig. [HP and C). The experimentally 
observed Af{Ro) was also consistent with the predicted square-root dependence for both E. coli 
and P. aeruginosa, as one can see in Fig. [6j\. 



Quantitative test of the models. 

The agreement between the experiments and the theory not only suggests that bacterial 
expansions can be described by the one-dimensional stepping-stone model, but also allows 
us to put this agreement to a quantitative test. 

From the fit of Eq. ([7]) to the dependence of sector number on the initial population 
radius, we estimate the two key parameters from the model Ds/v\\ and Dg/v\\. The estimate 
of Ds/v\\-i\ie degree of lateral movement by the genotypes as the population expands-forms 
a prediction that allows us to test the ability of Eq. (jl]) to capture the process of population 
expansion and genetic drift in the bacteria. This test is performed by taking the predicted value 
of Ds/v\\ and comparing it to a direct estimate that is made using our measurements of the sector 
boundary trajectories. Specifically, the diffusion constant of random walks performed by sector 
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boundaries was calculated by fitting the observed variance of (p{r) to the expected dependence 
for a random walk in the circular geometry given by Eq. ( ITO|) . see Fig. [5p. The parameters 
estimated by the two methods are summarized in Tabled! The quantitative agreement between 
these quite different estimates of Ds/v\\ suggests that our spatial population genetics model can 
indeed describe the simple bacterial range expansions discussed here. 

Although it is beyond the scope of this paper to quantitatively relate the microscopic 
properties of cells and colonies to the macroscopic parameters of our phenomenological model, 
we can nevertheless understand how to infer some information about the processes within the 
colony from the measured values of Dg and Dg. To make this connection, we need to know the 
cell size ac and the generation time r. These parameters do not have well-defined values because 
cells are rod shaped and the generation time presumably depends on the location of a cell 
relative to the front, where the nutrients are readily available. Therefore, we did not measure a 
and r, but we believe that it i s reasonable to assume that a^. is on th e order of 1 — 5 /xm, and r 
is on the order of 10^ — 10"^ s (lYang et al.l . l2008l : iBartlett et al.l . Il985l ). Given the wide range of 
uncertainty and the differences in Cc and r between different experimental conditions, we can 
only make a very rough estimates, and the numbers given below should be considered accurate 
within an order of magnitude. The velocity of expansion is given by N ,,arjT, where A^^, is the 



number of actively growing cells at the front (IHallatschek et al.l. 120071). From Table [T Ny is 
on the order of ten cells, which is similar to the earlier results ( IHallatschek et al.l . 120071 ). The 
square root of the product of the diffusion constant and the generation time is proportional to 
the root mean square lateral displacement of a cell during one generation. From the data, we 
estimate that this lateral displacement is on the order of ten cell sizes, which comparable to the 
advancement of the front during the same period of time. As we show in Appendix A [Eq. ( !T6|) ]. 
the strength of genetic drift Dg can be expressed in terms of effective population density pe] in 
fact, Dg = 1/{p^t). We find that the effective number of cells in a spat ial region of the size of a 
cell, g p, is smaller or much smaller than one, as has been predicted by IHallatschek and Nelson 
(l2008h . 



Conclusions 

Expanding bacterial colonies are subject to strong genetic drift, resulting in local extinctions 
of the genotypes present in the population. These local extinctions manifest themselves in 
the formation of mono-allelic sectors, provided the organisms migrate very slowly behind the 
expansion front. Despite the potential for complexity in the physical and biotic interactions 
within bacterial groups, our experiments strongly suggest that these boundaries behave as 
annihilating random walks, and we show how to estimate their diffusion coefficients from the 
fluorescent images of spatial genetic demixing (see Methods). 

The spatial separation of two alleles in an expanding bacterial colony can also be described 
by a generalization of the stepping-stone model, which accounts not only for the behavior 
of the sector boundaries but also for their formation from the initially well-mixed population. 
Although the expansion takes place on a two-dimensional surface, our experiments and simulations 
strongly suggest that its effects on the genetic composition of the population front can be 
described by a simple one-dimensional model with only four parameters (Z^^, Dg, Rq, and v\\), 
all of which can be measured experimentally as described in Methods. 

This generalization of the stepping-stone model not only explains spatial genetic demixing 
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but also characterizes neutral evolutionary dynamics of expanding populations quantitatively, 
just as the Wright-Fisher and Moran models capture essential aspects of the dynamics of 
stationary well-mixed populations. This quantitative description and the methods of estimating 
the model parameters could be important for understanding evolution within microbial communities. 
For example, the survival probability of a beneficial mutatio n depends on -D„ , characterizing 
the s t rength of genet i c drif t in spatially extended populations ( Maruyama . 1970 : Doering et al.l 



20031 : iKorolev et all . l2010h . and the evolution of cooperation depends on the rela t ednes s 



assortment) of the organisms, which is described by 0) ( Nadell et al. . 2010, 2009[ ) 



(or 
In 



addition, a better understanding of bacterial range expansions could also facilitate the development 
of new methods of genetic inference from spatially resolved data. 

More generally, we believe that a key contribution of this study is to illustrate the potential 
of spatial population genetic models to make quantitative predictions. Clearly, our bacterial 
system is not representative of all species and is a system that lends itself to making measurements 
that will not always be possible in other species. Nevertheless, our study demonstrates the 
potential for these models to generate accurate quantitative predictions as well as confirming 
the utility of population genetics as a method for evolutionary analysis. 
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Appendix A: Genetic drift in expanding populations 

In this Appendix, we briefly outline the theory leading to Eq. and discuss the relationship 
between Dg and effective population size Ng. Because the number of offspring fluctuates, allele 
frequencies change stochastically. In the classic Wright-Fisher model of a haploid population 
with two neutral alleles at a locus, the dynamics of the frequency of one of the alleles, f{t), is 
governed by the following stochastic differential equation 



dfit) 



-^f{t)[l~f{t)]dB{t), 



where B{t) is a Weiner process, and is the population size. The generation time r is shown 
explicitly to facilitate the transition to the spatial model later. For other population models, 
e.g. the Moran model, Eq. ( ITTl) still descr i bes the dynani i cs of f{t), but with a different 
coefficient inside the square root ( iKingmaru . Il982l : IWakeleyl . |2008| ). It is then convenient to 
introduce an effective population size Ng instead of A^ so that Eq. f lTTj) is satisfied. Under 
certain assumptions, the effective populati on size equals th e ratio of census population size 
to the variance in the number of offspring (iKingmanl . Il982l ). This choice of A'e is sometimes 
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calle d coale s cent effective population size ( IWakeleyl . 120081 ) or eigenvalue effective population 
size flEwenj - Eoiih . 

From stochastic differe ntial equation ffTTj). we can compute moments (e.g., average or 
variance) using Ito's lemma ( lRiskeru . ll989uGardineii . ll985[ ). For example, the average heterozygosity H{t) 
E[2/(t)[l — fit)] obeys the following equation 



d 



H{t) 



1 



■Hit), 



(12) 



dt ' ' NeT 

describing the loss of genetic diversity due to genetic drift. 

The one- dimensional step ping-stone model is a generalization of Eg. (II ip to an array of 
demes labeled by an index k (IKimura and Weisd . 1 19641 : iBarton et al.l . 120021 ): 



dh{t) 



m 



[^-i(t)-2/fc(t) + /fc+i(t)] 



1 



Mm-fkmdBk{t), 



(13) 



where m is the fraction of organisms migrating out of each deme into one of the two nearest 
neighbors. We can obtain a continuous in space description of this lattice model by introducing 
a spatial variable x = ka, where a i s the distance between the demes. The corresponding 
stochastic differential equation reads (IKorolev et al.l . |2010| ) 



dfit,x) = Ds-^f{t,x)dt + ^DJit,x)[l-fit,x)]dB{t,x), 



where 



and 



ma 



2t 



(14) 



(15) 



(16) 



Note that the factor of a in the definition of Dg arises from the normalization of the spatial 
Wiener process B(t,x). The ratio pe = Ne/a is the effective population deri sity, which appears 
in the continuous formulations of the stepping-stone model (jNagylakil . Il974l ) . When Eq. (fT4|) is 
applied not to strictly one- dimensional populations, bu t to linear expansion fronts, Dp depends 
on many demographic parameters in an intricate way (IHallatschek and Nelson! . |2008| ) . 

For circular expansions of interest to us here, it is convenient to introduce polar coordinates R{t) 
RQ + vut and around the center of the expansion. Equation (IT^ then takes the following form 



df{t, 



Ds 



f{t,(P)dt + 



A. 

R{t) 



f{t,c^)[l-f{t,4>)]dB{t, 



(17) 



To obtain Eq. ({4]), we use Ito's lemma to differentiate H{t,x) defined by Eq. ([3]) in terms 
of /(t,0). 
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Table 1: Measured values of the model parameters 



Experiment 


f|| (/im/s) 






D^/v^^ (/im) 


D'^/vii (/im) 


E. coli 


(8.3 ±0.4) ■ 


10-3 


0.75, (0.32, cx)) 


40, (10,90) 


32 ±2 


P. aeruginosa at 37°C 


(17.3 ± 0.9) 


■10-3 


oo, (3, oo) 


4.6, (4.0,5.2) 


3.7± 1.1 


P. aeruginosa at 21°C 


(6.9 ±0.3) ■ 


10-3 


0.17, (0.10,0.77) 


5.5, (1.2,13.0) 


5.2 ±0.5 



Expansion velocities, migration rates, and the strengths of genetic drift. and are two 
different estimates of the diffusion constant; their agreement indicates quantitative validity of 
our model. The values of v\\ were obtained by fitting R{t) to a linear function, see Fig. [31 Dg 
and Dg estimated from the fit of the data shown in Fig. [6] to Eq. ([71). For Dg/v\\ and Dl/v\\, 
the 95% confidence intervals are given in parentheses. The infinite values of Dg/v\\ are due to 
the possibility of the linear fit (see Fig. [61A.) passing through the origin. D"^ are measured 
from the random walks of sector boundaries [see Eq. f[TOl) ]. and the error estimates represent 
the standard deviation between different repetitions of the experiments. See Methods for a 
detailed description of the estimating procedures. 
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Figure 1: Genetic demixing in experiments (A), on-lattice simulations (B), and off-lattice 
simulations (C). Different colors label different alleles, and the initial mixing ratio was 1:1. (A) 
The Petri dish was inoculated with a drop (bounded by the white circle) of a mixture of Pseudomonas 
aeruginosa cells labeled with two different fluorescent markers that do not affect the relative fitness of 
the cells. As the drop dries out and the colony starts to expand (shown with arrows), the population 
at the front of the expansion demixes (separates) into sectors of different col ors. This genetic 



demixing is due to the loss of local genetic diversity d uring the expansion, see (iHallatschek et al, 



20071 : iHallatschek and Nelsonl . l20ld : lKorolev et all . l20ld ). This plate was incubated at 37° C. (B) The 
habitat was a 1000 x 1000 array of islands arranged on a square lattice. Each island could harbor 
at most N = 30 individua ls, consistent with the width of the layer of actively growing cells found 



in (IHallatschek et al.l . 120071 ). The simulation was started with a well-mixed population occupying a 
disk of radius Rq = 80 lattice spacings in the center of the habitat. (C) The simulation was initiated 
with 1024 agents placed at random within a circular inoculant in the center of the habitat. The boxed 
region of the colony is shown in the inset at a higher magnification. We attribute the slightly square 
shape of the colony to the underlying square grid used to solve the nutrient reaction-diffusion equation. 



24 




B 0.5 
0.4 
^ 0.3 

-M 

:i= 0.2 

0.1 















- r=80 




' r=90 




■ r=105 


Y "tf J 


= r=135 




■■ r=255 




— r=450 


-0.5 


Q5 



D 



0.5 



0.4 

-gio.B 

X 0.2 
0.1 







to ^ 

w 


«r=(5/30)Rf 
• r={10/20)Rf 
-r=(20/30)Rf 



-0.2 



-0.1 



01 



0.2 



Figure 2: Plots of average spatial heterozygosity as the function of angle during range expansions 
from 1:1 well-mixed populations in our model (A), on-lattice simulations (B), off-lattice simulations 

(C) , and experiments (D). (A) Solution of Eq. (jH with = 1, Dg = 1, V|| = 1, and Rq = \ various 
times t. Note that there is no significant difference between /f(500, (/>) and -ff(1000, 0) because </>) 
reaches a nontrivial limit-shape as t — )• oo. (B) H{t,(p) from 24 on-lattice simulations with the same 
parameters as in Fig. [TJ3, except N = 300. In agreement with (A), we see a gradual decrease of H{t, 0) 
with time. The radius r = v^^t + Rq is in a direct correspondence with time t. (C) H{t,(j)) at the 
expansion frontier from 10 off-lattice simulations, as in Fig. [T]CI. Because off-lattice simulations model 
a monolayer of cells, any spatial point in a colony has a unique genetic state; hence H{t, 0) = 0. 

(D) The average spatial heterozygosity was calculated from 8 E. coli colonies inoculated with 3 ^1 of 
the bacterial mix of cells. As in (B), radius r is directly related to time t because the spatio-genetic 
pattern is a frozen record of the genetic composition of the front. The dip in H(t, (j)) widens with time 
in agreement with Eq. @. Similar to (C), H{t, 0) = because we assume that /(t, (j)) is either zero or 
one at every pixel. This assumption is only valid after the initial fixation time discussed in the main 
text. 
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Figure 3: Linear increase of colony radii with time. The radii of three colonies of P. aeruginosa grown 
at 21°C are plotted as the functions of time. The three sets of data correspond to three different 
inoculation volumes: 1 lA (red dots), 3 /il (green squares), and 14 (blue stars). The solid lines are 
the least square fits to straight lines. For all three colonies, the radius increased linearly with time 
and the rate of the increase was approximately independent of the initial size of the colony, as would 
be the case under high nutrient conditions. Note that, for the largest colony, the expansion seems to 
slow down slightly at the end, possibly due to nutrient depletion. Similar behavior was observed for 
other colonies of P. aeruginosa and E. coli for all growth conditions studied. The obtained values of 
the expansion velocities are given in Table [TJ The inset shows a fluorescent image of P. aeruginosa 
colony grown at 21° C; fluorescence from only one of the two alleles is shown. 
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Figure 4: Neutrality of P. aeruginosa strains at 37°C. The gray dots represent individual 
measurements along the sector boundaries at pixel resolution (one pixel is 50 /im) from 24 
colonies (524 sector boundaries) of P. aeruginosa. Positive if represent boundary bending from 
cyan towards yellow sectors. Each red dot is the mean position of the gray dots in one of the 10 
equidistant bins along the x-axis. The error bars represents 95% confidence interval of the mean 
values. The green line is the expected dependence if the strains are neutral, E[(y9(r)] = 0. Since 
the expected line passes through all of the error bars, we cannot reject the hypothesis that the 
strains are neutral. In fact, the largest fitness difference consistent with the data is about 10~^. 
Th is comes from using Eg . (|9l) to find v\ lv\\ and relating wx/fy to the fitness difference as 
in ( Hallatschek and Nelsonl . l2010l : iKorolevl . l2010l ). It is then reasonable to conclude that these 



strains can be considered equally fit on the time scales of our experiments. The corresponding 
data for P. aeruginosa strains grown at 21°C is also consistent with the as sumption that the 



strain s are equally fit. E. coli strains are also equally fit, see Fig. [5] and (IHallatschek et al. 
2007h . 
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Figure 5: Chirality and boundary wandering in E. coli colonies. (A) Genetic demixing in a 
chiral E. coli colony. The image, taken from the bottom of the Petri dish, shows the fluorescent 
signal from only one of the two segregating alleles. Bacterial colonies of P. aeruginosa at 37°C 
and 21°C did not exhibit chiral growth. (B) Twisting of sector boundaries in E. coli colonies. 
The gray dots represent individual measurements along the sector boundaries at pixel resolution 
(one pixel is 50 /xm) from 30 colonies (390 sector boundaries). Each red dot is the mean position 
of the gray dots in one of the 50 equidistant bins along the x-axis. The green line is the least 
square fit to the red dots. According to Eq. ([9]), the slope of the green line equals which 
yields v^/v\\ = 0.32. As shown in (A) and (B), all sector boundaries twist on average in the same 
direction for chiral growth. In contrast, sector boundaries between non- neutral strains bend in 
both clockwise a nd counterclockwise directions because the boundaries around the fitter strains 
bend outwards ( iHallatschek and Nelson! . 120101 ). To exclude possible small fitness differences, 
we also separately analyzed boundaries that would bend in opposite directions if the strains 
were non-neutral. Non-neutrality would result in different values of v±/v\\ for these two sets of 
boundaries. Upon applying equivalent tests to those used in Fig. |l]we confir med that v \ /vn are 



the s ame, which supports the previous finding that these strains are neutral ( IHallatschek et al. 



20071 ). (C) Random walks of sector boundaries. The same data as in (B) is used to plot the 
variance of ip{r); gray dots in the individual data points, and the red dots are the averages. 
The green line is the least square fit to the first 25 red dots; the last 25 dots are not used 
because of large fluctuations due to a smaller sample size at large r. According to Eq. (ITII]) . 
the slope of the green line equals 2Ds/v\\. The data sets for P. aeruginosa at 37°C and 21°C 
show similar behavior of the variance of (p{r), but the data set for E. coli fluctuates less partly 
because of the larger sample size and larger spatial diffusion constant compared to the other 
two experiments. 
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Figure 6: Dependence of the diversity at the expansion front on the initial size of the population in 
experiments (A), on-lattice simulations (B), and off-lattice simulations (C). The average number of 
sectors at the end of a range expansion N is plotted versus the square root of the initial radius of the 
colony i?o- These coordinates are chosen so that the theoretically predicted dependence [see Eq. ([7])] 
is a straight line (solid lines). The error bars represent 95% confidence interval. Parameters Ds/v\\ 
and Dg/v\\ can be estimated from the slope of the fit and its intercept with the y-axis respectively. 
(A) E. coli (black squares), P. aeruginosa at 37°C (red triangles), and P. aeruginosa at 21°C (blue 
dots). (B) Three different island carrying capacities: = 3 (black squares), = 30 (red triangles), 
and A^ = 300 (blue dots). Each data point is the average of 40 simulations. Notice that the population 
with higher A^ have more sectors in agreement with Eq. (3) because populations with higher densities 
have lower Dg. In this plot, the error bars are represented by the size of the markers used. (C) Due to 
the discrete nature of these simulations, we use the square root of the number of cells at inoculation 
as a proxy for the initial radius. 
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